Methods and Materials for Generating Genetic Disruptions in 

Bacterial Cells 



5 The present application claims priority to US Provisional 
Application 60/399,751 filed July 31, 2002, the entire 
disclosure of which is incorporated by reference herein. 

FIELD OF THE INVENTION 

10 The present invention relates to methods and materials for 

generating genetic disruptions in bacterial genetic material, 
especially genetic disruptions in genetic material of 
Streptomyces spp. . 

15 The most preferred genetic material for study is that of 

S .coelicolor M145, a plasmid-free (SCPl" SCP2") derivative of the 
wild type S .coelicolor A3 {2) strain. Streptomyces coelicolor 
A3 (2) is genetically the most studied Streptomyces species. It 
is for this reason that the entire single, linear, 8,800 kb 

20 chromosome of S. coelicolor A3 (2) was sequenced (using 

S .coelicolor M145) at the Sanger Centre. 7825 genes were 
predicted, with an average gene size of 1.1 kb. 53% of these 
7825 genes have no known function. Most of these genes are 
probably non-essential for growth under normal laboratory 

25 conditions. It is of great interest to study these genes, and 
to this end it is of interest to generate mutants containing 
disruptions in different non-essential genes (and/or their 
control sequences) , resulting in mutations (often knock-out 
mutations) of those genes. 

30 

Once an interesting mutation has been identified, it may also be 
of interest to introduce it into other species or strains, to 
investigate its effect in different genetic backgrounds. 
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It may also be of interest to study the control of genes 
containing interesting mutations. 

In the work leading to the present invention, the inventors have 
5 constructed a novel transposon, designated Tn5062, which 

contains an origin of transfer, a three frame translational stop 
sequence, an antibiotic resistance marker and a promoterless 
copy of the enhanced green fluorescent protein gene (EGFP) 
between Tn5-like inverted repeat sequences. 

10 

Transposition of this novel transposon was performed in vitro 
into cosmid DNA from an S. coelicolor cosmid library, using the 
in vitro transposition protocol of Epicentre (Madison, WI, USA) , 
and the transposed cosmid DNA was used to transform E. coli 
15 cells. Transposition target sites from different transposition 
events were determined by sequencing using transposon- specif ic 
primers and transposed cosmid DNA as the template. 

Replacement of a wild-type gene with a disrupted, cosmid-borne 
20 copy was effected by conjugation from E. coli, followed by 

homologous recombination, and determined by marker replacement. 

The methods and materials of the invention and various preferred 
embodiments offer certain benefits, which are not provided by 

25 other previously known mutagenesis techniques that use 

transposons (such as that disclosed in PCT/GB02/02884) . In 
particular, the inclusion of a step of conjugating transposed 
DNA allows the same mutation to be transferred into multiple 
genetic backgrounds (e.g. different actinomycete species or 

30 strains) ; it also allows more convenient identification of the 
site of transposition than if transposition were carried out in 
the host cell; it also allows more flexibility in storage of the 
mutation, e.g. as purified cosmid DNA or in E. coli cells; it is 
also advantageous in the absence of a reliable generalised 

35 transduction system (as is the case for Streptomyces , although 
electroporation has been demonstrated for some species) . The 
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invention is preferably applied to the mutagenesis of DNA from a 
species whose genome has been sequenced (such as S. coelicolor) 
or a related species. 

5 Accordingly, in a first aspect, the invention provides a nucleic 
acid construct comprising inverted repeat sequences of a 
transposable element and an origin of transfer, wherein the 
origin of transfer lies between the inverted repeat sequences, 
such that a transposition event involving the inverted repeat 
10 sequences will result in the origin of transfer being included 
in the resultant insertion at the transposition target site. 

In a second aspect, the invention provides a method for 
mutagenising DNA of interest from a bacterial species, the 
15 method comprising: 

(a) contacting said DNA of interest with a nucleic acid 
construct of the invention, to form a transposition mixture; 

(b) incubating the transposition mixture under conditions 
suitable for transposition to occur, said contacting and 

20 incubating steps being performed other than within cells of said 
bacterial species ; 

(c) transferring transposed DNA of said transposition 
mixture by conjugation from a donor bacterial cell into a host 
bacterial cell; and 

25 (d) incubating the host cell under conditions suitable for 

homologous recombination between the transposed DNA and the DNA 
of the host cell. 

In practice, the method will generally involve the use of 
30 multiple donor and host cells. 

For the avoidance of doubt, it is hereby stated that the term 
"inverted repeat sequences" refers to the short (typically about 
10-40 bp) terminal inverted repeat sequences of an insertion 
35 sequence (IS element) or class II transposon, which interact 

with a transposase to mediate transposition. It does not refer 
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to an entire IS element from a class I transposon. Class I 
transposons consist of one or more structural genes flanked on 
either side by an IS element (which may be identical or 
different) • For example, Tn5 consists of two IS elements 
5 (designated "IS50L" and ''IS50R") flanking various structural 
genes. Each IS element of a class II transposon therefore 
includes inverted repeat sequences. 



Especially preferred inverted repeat sequences are the 19 bp 
10 ''Mosaic Ends" of the EZ::TN ™ system of Epicentre, as used in 

the Example and labelled ''OE-L" and ''OE-R" in Fig. 1, having the 
sequence : 

5'-CTGTC TCTTA TACAC ATCT-3' 

3'-GACAG AGAAT ATGTG TAGA-5' 
15 These specific inverted repeat sequences are recognised by the 
well-known and well-characterised hyperactive mutant TnS 
transposase for high frequency transposition. This mutant 
transposase is commercially available (e.g. from Epicentre, as 
EZ: :TN ™ transposase) . 

20 

However, it is contemplated that many other inverted repeat 
sequences may be used in the practice of the invention. The 
skilled person will be aware of other transposons, the inverted 
repeat sequences of which may also be used, in conjunction with 

25 a transposase enzyme capable of recognising the inverted repeat 
sequences (see e.g. Singleton and Sainsbury (1987) Dictionary of 
Microbiology and Molecular Biology, 2nd edition, John Wiley & 
Sons, under the entry '^transposable element" and other entries 
referred to therein, as well as references cited in those 

30 entries, e.g. the review Grindley and Read (1985) and Shapiro 

(ed) (1983) ''Mobile Genetic Elements", Academic Press; see also 
Berg and Howe (1989) and Kieser et al (2000)) . Indeed several 
transposon systems are commercially available, each comprising a 
transposon and a transposase capable of recognising and 

35 mediating transposition of the transposon. These and other 

publicly known transposons could readily be adapted for use in 



accordance with the invention. Examples of commercially 
available systems include the Genejumper*^ system of Invitrogen 

Corporation (Carlsbad, CA, USA) , based on the bacteriophage \xA 

transposon; the \\A transposon system of Finnzymes Oy (Espoo, 
5 Finland) ; and the GPS system of New England Biolabs (Beverly, 
MA, USA), based on the Tn7 transposon. 

Generally, the inverted repeat sequences and transposase will 
originate from the same transposon or related transposons . Also 
10 contemplated and within the scope of the invention are variant 
inverted repeats and/or transposases, which remain capable of 
interacting with each other to mediate transposition. 



Generally preferred inverted repeat sequences are, or are 
15 derived from, the OE and/or IE inverted repeat sequences of the 
transposon Tn5 (from which the EZ::TN ™ Mosaic Ends are 
themselves derived) . The OE sequence is 5'-CTGAC TCTTA TACAC 
AAGT; the IE sequence is 5'-CTGTC TCTTG ATCAG ATCTT GATC. Tn5 
has the most random insertion pattern of known transposons. 
20 This property is shared by the EZ: :TN ^ Mosaic Ends. Such 
inverted repeats will generally be used with native Tn5 
transposase (though this is not suitable for in vitro 
transposition) or, preferably, the commercially available 
hyperactive mutant TnS transposase (e.g. of Epicentre), which is 
25 suitable for in vitro use. The mutant TnS transposase is 

capable of recognising both the Mosaic Ends and the wild-type 
TnS inverted repeat sequences . 

TnS- like inverted repeat sequences preferably display at least 
30 70%, 75%, 80%, 85%, 90% or 95% sequence identity with any one or 
more of the TnS OE sequence, the TnS IE sequence and the Mosaic 
Ends, a comparable level of identity to that shown between the 
OE sequence and the Mosaic Ends (16 identical bases out of 19) 
and between the IE sequence and the Mosaic Ends (15 identical 
35 bases out of 19 in the shorter sequence) . Percentage sequence 
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identity is defined as the percentage of nucleic acid residues 
in the shorter of the sequences under comparison that are 
identical to corresponding nucleic acid residues in the other 
sequence, when the sequences are aligned. Up to a total of 5 
5 gaps may be included in one or both sequences to optimise the 
alignment. Like the Mosaic Ends, a Tn5-like sequence may be a 
hybrid of the OE and IE sequences, each residue of the Tn5-like 
sequence being selected from the residues found at the 
corresponding position in either the IE or the OE sequences. 

10 

The inverted repeat sequences preferably do not demonstrate high 
target site specificity. This provides the advantage of 
allowing essentially untargeted gene disruption to occur, rather 
than bias towards disruption at a limited number of locations 

15 possessing the relevant specific target site. The Tn3 and ISI 
transposable elements show preference towards AT-rich regions. 
Such target site specificity is not regarded to be high for the 
present purposes and transposable elements having Tn3-like or 
ISI- like inverted repeat sequences are within the scope of this 

20 preference. Both Tn3 and TnIO transposable elements display 
hotspots for insertion, at which insertion occurs with greater 
frequency than at other locations, so some bias towards these 
hotspots might be expected. Hotspots for Tn3 have homology with 
the Tn3 inverted repeat sequences; those for TnlO bear no 

25 obvious relationship to the TnIO inverted repeat sequences. 

Such target site specificity is also not regarded to be high for 
the present purposes and transposable elements having Tn3-like 
or TnlO-like inverted repeat sequences are also within the scope 
of this preference. By contrast, the IS5 transposable element 

30 inserts only at sites containing the target site C(T/A)A{G/A) . 
This level of specificity is not within the scope of this 
preference . 

Particularly when transposition is intended to occur within a 
35 bacterial cell (see below) , the construct may encode all the 
functionalities necessary for transposition to occur, e.g. a 
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transposase (which will usually originate from the same 
transposable element as the inverted repeat sequences) . Some 
transposable elements, which transpose by a replicative 
mechanism, also require a resolvase gene and an internal 
5 resolution site for transposition to occur. These may be 
included if required. 

Preferably, however, the construct is intended for use in 
transposition in vitro, i.e. outside any bacterial cell. In 

10 such cases, it preferably does not encode a transposase. 

Rather, the transposase protein is added to the transposition 
reaction mixture. Again, the transposase will usually originate 
from the same transposable element as the inverted repeat 
sequences. Preferred transposase is the commercially available 

15 hyperactive mutant TnS transposase. 

Many sequences for the TnS transposase are available, e.g. via 
GenBank. Goryshin and Rezinkoff (1998) describes a hyperactive 
form of the transposase, on which the EZ;:TN ™ transposase 
20 system is based. See also Davies et al. (2000), which reports 
the 3D structure of the enzyme-DNA complex. 

Particularly when the construct is intended to be used for 
transposition in vitro, the inverted repeat sequences are 
25 preferably of, or derived from the inverted repeat sequences of, 
a transposable element that employs a non-replicative (e.g. a 
cut-and-paste) transposition mechanism. The TnS transposon 
replicates in this manner; a further example is TnlO. 

30 A transposition event involving the construct of the invention 
will lead to the insertion of the origin of transfer into the 
transposition target site. If the transposition target site is 
in a circular DNA molecule, such as a cosmid or other plasmid, 
the circular DNA molecule can then be mobilised from one 

35 bacterial cell (the donor cell) into another bacterial cell (the 
host cell) by conjugation. The DNA of interest is therefore 



preferably contained in one or more circular DNA molecules 
(typically a large number of such molecules) , such as a cosmid 
or other plasmid. 



5 Preferably the origin of transfer is an oriT which can be 

mobilised by the helper plasmids pUZ8002 and pUB307, such as an 
oriT from an IncP-group plasmid, such as RP4 (also designated 
RP1/RK2; Pansegrau et al., 1994), preferably having the nucleic 
acid sequence: 

10 CCGGGCAGGA TAGGTGAAGT AGGCCCACCC GCGAGCGGGT GTTCCTTCTT 

CACTGTCCCT TATTCGCACC TGGCGGTGCT CAACGGGAAT CCTGCTCTGC 
GAGGCTGGC , 

or a variant thereof having origin of transfer function. 
However, the use of any other suitable origin of transfer is 
15 also contemplated. 

Preferably the construct comprises a selectable marker gene 
(such as an antibiotic resistance gene) , located between the 
inverted repeat sequences. The presence of a selectable marker 
20 in the insertion following a transposition event allows 
convenient identification of transposed DNA. 

Preferably the construct comprises a reporter gene (preferably a 
promoter less reporter gene) , located between the inverted repeat 

25 sequences. Insertion of a promoterless reporter gene into a 

target site downstream of a promoter will allow analysis of gene 
expression under the control of that promoter. Preferably the 
promoterless reporter gene is operatively associated with a 
ribosome binding site. Preferably the construct further 

30 comprises, upstream of the reporter gene and ribosome binding 
site and between the inverted repeat sequences, a translational 
stop sequence (preferably a three- frame translational stop 
sequence) . Expression of a fusion protein (of the partial gene 
product of the mutagenised gene fused to the reporter gene 

35 product, or a nonsense product of the reporter gene) may 

interfere with expression of the reporter gene from its own 
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ribosome binding site. This will be prevented by the presence 
of the translational stop sequence. 

Preferred reporter genes are visible or visualisable . 
5 Especially preferred are genes for fluorescent proteins, e.g. 
those commercially available from BD Biosciences Clontech (Palo 
Alto, CA, USA, a division of Becton Dickinson, Franklin Lakes, 
NJ, USA) , such as enhanced green fluorescent protein (EGFP) . 

10 The construct may be linear, and may consist essentially of the 
inverted repeat sequences and any sequences located therebetween 
(i.e. the inverted repeat sequences may lie essentially at 
respective termini of the linear nucleic acid molecule) , 
optionally with short sequences outside the inverted repeat 

15 sequences (e.g. sequences containing PGR primer binding sites 
and/or restriction sites, particularly sequences of 100 bp or 
less, 75 bp, 50 bp, 30 bp, 20 bp, 15 bp or lObp) . The 
construct may in particular lack an origin of replication. 

20 The construct may, however, be included within a vector, e.g. a 
plasmid- Typically, such a vector will include convenient PGR 
primer binding sites and/or restriction sites for the 
amplification or excision from the vector of a linear nucleic 
acid consisting essentially of the construct. In preferred 

25 embodiments of the invention, the construct will be intended for 
use, as such a linear molecule, in an in vitro transposition 
reaction. Apart from (and outside of) the construct of the 
invention, the vector may typically include an origin of 
replication and/or a selective marker. 

30 

The contacting and incubating steps (a) and (b) of the method of 
the invention may occur inside the donor (e.g. E. coli) cell, 
especially by transferring (e.g. by electroporation) the 
construct of the invention into cells containing a DNA of 
35 interest (e.g. E. coli cells containing an actinomycete cosmid) . 
A preferred construct of the invention does not encode 
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transposase, but may be pre -incubated with transposase to form a 
stable complex (referred to as a ''transposome" in Epicentre 
literature) , which may then be transferred into the donor cell 
(again, e.g. by electroporation) for the contacting step. 
5 Transposase enzymes typically require the presence of a metal 
ion (e.g. Mg^"" for Tn5 and the EZ::TN system) for transposition 
to occur. The pre -incubation step will therefore generally be 
performed in the absence of such an ion. 

More preferably, however, the contacting and incubating steps 
(a) and (b) occur outside any bacterial cell, and the method 
comprises the further step (bl) of transferring the transposed 
DNA of the transposition mixture into the bacterial donor cell 
(preferably an E. coli cell) , prior to the conjugation step (c) 
into the host cell. This has the advantage that transposition 
into the donor cell genome will be avoided. In this case also, 
the incubating step (b) will be carried in the presence of 
transposase (and any necessary metal ion, such as Mg^"" for Tn5 
transposase, or the hyperactive mutant thereof) . The step (bl) 
of transferring the transposed DNA of the transposition mixture 
into the donor cell may for example be accomplished by 
electroporation using the transposition mixture (optionally 
after stopping the transposition reaction, e.g. by denaturing 
the transposase) . Following step (bl) , the method preferably 
includes the step (b2) of detecting whether the donor cell has 
taken up transposed DNA. This may involve identifying in the 
donor cell the presence of a selectable marker gene included 
within the construct of the invention. 

30 Particularly when the sequence of the DNA of interest is known 

(as it is for e.g. S. coelicolor A3 (2)), the method may comprise 
an additional step of identifying the site in the DNA of 
interest at which a transposition event has led to an insertion. 
This may involve sequencing, preferably using a sequencing 

35 primer that binds to a site within the construct of the 

invention. The first sequence data will then correspond to a 
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partial sequence of the construct, ending with one of the 
inverted repeat sequences. This will be followed by sequence 
data corresponding to the insertion site in the DNA of interest. 
Following a transposition event, the insertion site in the DNA 
5 of interest will be duplicated, the two copies of the insertion 
site being separated by the inverted repeat sequences and all 
sequences of the construct that lie between the inverted repeat 
sequences . 

10 The insertion site can be identified after conjugation into the 
host cell and homologous recombination with the native gene. 
However, this requires isolation and manipulation of genomic 
host DNA, rather than the DNA of interest, which may be of 
smaller scale than the entire host genome. Accordingly, the 

15 insertion site is preferably identified before the conjugation 
step. 

The construct may be designed to include one or more suitable 
sequencing primer binding sites, preferably located close to the 
20 inverted repeat sequences. 

Preferably the DNA of interest is a DNA from a bacterial 
library. Preferably the bacterium from which the library is 
generated is an actinomycete, more preferably a streptomycete, 
25 more preferably a bacterium of the genus Streptomyces , more 

preferably of the species S. coelicolor, more preferably of the 
strain S. coelicolor A3 (2) . 

For transfer of transposed DNA of interest bearing an RP4 oriT 
30 to occur by conjugation, a transfer function should be supplied, 
preferably in trans, e.g. by an E. coli donor strain such as 
ET12567 carrying the self -transmissible pUB307 (Bennett et al., 
1977, Flett et al., 1997) or ET12567 carrying the non- 
transmissible PUZ8002 (Kieser et al., 2000). 

35 
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Thus step (c) preferably includes such provision in trans of 
transfer function. This step may involve transforming the 
transposed DNA of interest (e.g. transposed cosmid) into a donor 
strain carrying a non- transmissible transfer plasmid (e.g. 
5 ET12567/pUZ8002) , followed by incubation under suitable 
conditions with the host cell. 

The host cell is preferably an actinomycete cell, more 
preferably a streptomycete cell, more preferably a cell of the 
10 genus Streptomyces , more preferably a cell of the species S. 

coelicolor, more preferably a cell of the strain S. coelicolor 
A3 (2) . The host cell is preferably a pre-germinated spore. 

The host cell will frequently be of the species or strain from 
15 which the DNA of interest originates (particularly S. 

coelicolor) . It is a particular advantage of the invention, 
however, that the same mutation can be introduced by conjugation 
into different host cells, which may be of different species or 
strains, such as different streptomycete strains. 

20 

For certain embodiments, which use an actinomycete bacterial 
host cell, the present invention provides the advantages that 
introduction of the mutagenised genetic material can be 
accomplished without protoplast transformation and regeneration, 

25 using conjugation into pre-germinated spores, and that 

(especially with conjugation from E. coli) the procedure is 
broadly applicable in introducing DNA into actinomycetes other 
than S. coelicolor (Matsushima et al . , 1994). As well as being 
laborious, protoplast transformation and regeneration procedures 

30 may produce mutations . 

The donor cell is preferably of a different cell type (i.e. of a 
different bacterial class, or higher taxonomical ranking) from 
the host cell; any convenient bacterial cell may be used. For 
35 convenience, however, the donor cell is most preferably an E. 
coli cell. 
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Where the host cell has methylation- specific restriction system 
(e.g. S. coelicolor has such a system, although the related 
strain S. lividans does not, MacNeil et al., 1992), the donor 
5 cell is preferably methylation-def icient , e.g. £• coli strain 
ET12567 (MacNeil et al., 1992). 

The method may comprise an additional step (e) of detecting 
whether homologous recombination has occurred in the host cell. 
10 This may for example be indicated by the loss in the host cell 
of a selectable marker that is borne by the DNA (e.g. cosmid) of 
interest, but the retention of a selectable marker that is borne 
by the construct of the invention. This may for example be 
determined by replica plating. 

15 

The method may comprise an additional step, prior to the 
conjugation step, of replacing part or all of the transposition- 
derived insert by a further step of homologous recombination, to 
remove sequences from the insert and/or to add sequences to the 

20 insert, e.g. to include different marker genes in the insertion 
and/or to generate an in- frame translational fusion of an 
interrupted host cell coding sequence and a reporter gene in the 
insertion (e.g. by removing the translational stop sequence and 
reporter gene ribosome binding sequence) . This step is 

25 preferably performed in a cell (preferably an E, coli cell) 
induced for Red-mediated recombination, e.g. on a transposed 
cosmid that has been transformed into the cell. 

The mutant host cells produced according to this method may be 
30 stored for future use, in any suitable form, e.g. (when the host 
cell is an actinomycete) as spores. However, it may be more 
convenient to store mutagenised DNA of interest in the donor 
cells, other transformed bacterial cells not necessarily 
suitable for use as a conjugation donor (e.g. E. coli cells) or 
35 simply as DNA, e.g. in the form of isolated cosmids. 
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Accordingly, in a third aspect, the invention provides a method 
for mutagenising DNA of interest of a bacterial species, the 
method comprising the steps of: 

contacting said DNA of interest with a nucleic acid 
5 construct of the invention, to form a transposition mixture; 

incubating the transposition mixture under conditions 
suitable for transposition to occur, said contacting and 
incubating steps being performed other than within cells of said 
bacterial species; and 

10 storing transposed DNA of said transposition mixture for 

future use in a method comprising transferring said transposed 
DNA from a donor bacterial cell into a host bacterial cell by 
conjugation and incubating the host cell under conditions 
suitable for homologous recombination between the transposed DNA 

15 and the DNA of the host cell. 

In a fourth aspect, the invention provides a method for 
mutagenising DNA of interest of a bacterial species, the method 
comprising, following the production and storage of transposed 
20 DNA according to the third aspect of the invention, the steps 
of: 

(c) transferring said transposed DNA by conjugation from a 
donor bacterial cell into a host bacterial cell; and 

(d) incubating the host cell under conditions suitable for 
25 homologous recombination between the transposed DNA and the DNA 

of the host cell. 

The present invention also provides a host cell producible or as 
produced by the process of the second and/or fourth aspect. 
30 Furthermore, the invention provides a method of determining the 
effect of a genetic disruption, the method comprising culturing 
such a host cell and determining the effect of the disruption on 
the cell. 

35 The present invention also provides transposed DNA of interest 
producible or as produced by the process of the third aspect, 
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optionally contained in a bacterial cell or cells (e.g. E, 
coli) . 

Preferably the method of the second aspect will be carried out 
5 simultaneously on several DNA molecules of interest (e.g. copies 
of a cosmid) , which are conjugated from different donor cells 
into different host cells, to produce a plurality of 
independently mutated host cells. 

10 Except where the context requires otherwise, all preferred 

features referred to herein are applicable to all aspects of the 
invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

15 Embodiments of the invention will now be described, by way of 
example only, with reference to the accompanying drawings, in 
which: 

Figures lA and IB show the sequence of transposon Tn5062, a 
20 nucleic acid construct according to the first aspect of the 
invention, along with the location of various components; 

Figure 2 shows the construction strategy for Tn5062. 

25 

DETAILED DESCRIPTION OF THE INVENTION 

EXAMPLE 

30 Summary 

A procedure for efficient systematic mutagenesis of 
streptomycete genes is described. The technique employs in vitro 
transposon mutagenesis, using a novel transposon Tn5052. 
35 Mutations are initially derived in cloned streptomycete DNA 

propagated in Escherichia coli. The mutations are then mobilised 
into a streptomycete host in which marker replacement by 
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homologous recombination occurs. The incorporation of a 
promoter- less copy of the eGFP reporter gene in Tn5062 permits 
temporal and spatial analysis of expression of a transposon 
tagged gene. 

5 

Bacterial strains, plasmids and oligonucleotides 

Streptomyces coelicolor iyil45 was used as a test strain for 
transposition and was cultured on SFM agar using standard 

10 techniques according to Kieser et al. (2000) • All DNA 

manipulations were carried out using Escherichia coli JM109 
(Yanisch- Perron et al . , 1985) or Escherichia coli Sure 
(Stratagene) . E.coli was grown on either L agar plates, L broth 
or 2xYT broth (Sambrook et al. 1989). E.coli ET12567 (MacNeil et 

15 al., 1992) carrying pUZ8002 (Kieser et al., 2000) was used as a 
host to mobilize transposed cosmids into S .coelicolor M145. 
Oligonucleotides and plasmids/cosmids used in this study are 
listed in tables 1 and 2 respectively, 

20 DNA manipulations 

Plasmid isolations were carried out using Wizard SV kits from 
Promega and cloning steps were performed using established 
procedures. Restriction endonucleases and T4 ligase (obtained 
25 from New England Biolabs, Life Technologies or Promega) were 
used according to the manufacturer's instructions. 

Construction of Tn5062 

30 The first step in the construction of the transposon, TnS052 

(Fig. 1) , was to clone eGFP from pEGFP-Nl to pALTERl as a 787bp 
Hindlll-Xbal fragment creating pFPll (Fig. 2) . This allowed an 
Ndel site to be introduced at the start codon of eGFP by site 
directed mutagenesis using the altered sites kit from Promega 

35 according to the manufacturer's instructions resulting in pFP12 
(ACCATG (pFPll) was changed to CATATG (pFP12) ; where ATG is the 
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first codon of the eGFP gene) . The three frame translational 
stop was constructed as a linker made from the oligonucleotides 
VCl and VC2 (MWG-Biotech) and cloned into Bglll-Ndel digested 
pET26B+ creating pVClOl, This was digested with Ndel and Xhol 
5 and ligated to a second linker synthesised as the 

oligonucleotides VC3 and VC4 carrying a Streptomyces consensus 
ribosome binding site creating pVC102. eGFP was cloned from 
pFP12 into pVC102 as 725bp Ndel-Eagl fragment giving pVClOV. 
aac3{IV) was first moved to pALTERl from pHP45Qaac as a 1783 bp 

10 EcoRI fragment creating pQMSOl, This plasmid was digested with 
Smal and the 1794bp aac3(IV) fragment introduced between the TnS 
inverted repeats of p]yiOD<MCS> by blunt-ended ligation with 
EcoICRI and Hindi digested pMOD<MCS> resulting in pQM504. oriT 
was introduced into pQM504 as a 786bp psti fragment from pIJ8660 

15 giving pQM5052 . Finally eGFP was added to pQM5052 as a 782bp 
EcoRI fragment from pVC107. This plasmid was named pQiyi5062 and 
allows Tn5052 to be liberated from the plasmid backbone by 
digestion with PvuII as a 3442 bp fragment (Fig.l), The sequence 
of the transposon was verified by restriction digestion with 

20 appropriate enzymes and sequencing using a Beckman- Coulter CEQ 
2000XL sequencer according to the manufacturer's instructions. 

Cosmid DNA isolation 

25 Selected (Table 2) cosmids from the S . coelicolor A3 (2) cosmid 

library (Redenbach et al., 1996) were obtained from Helen Kieser 
(John Innes Centre, Norwich, UK) as E.coli Sure cultures. Cosmid 
DNA was isolated from E.coli Sure using Wizard SV minipreps 
(Promega) according to the manufacturer's instructions- Cultures 

30 were grown at 37 in L broth containing ampicillin (50 /xg/ml) 
and kanamycin (25/^g/ml) and isolated DNA transformed into E.coli 
JM109 by electroporation using a Biorad Gene Pulser according to 
the manufacturer's instructions on L agar plates containing 
ampicillin (50 jug/ml) and kanamycin (25/xg/ml) . Cosmid DNA was 

35 again isolated using Wizard SV kits according to the 

manufactures instructions except that cultures were incubated in 
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2xYT broth containing ampicillin (50 /xg/ml) and kanamycin 
(25/xg/tnl) for exactly 18 hours at 250rpm. DNA was eluted from 
the spin column twice with 40/^1 of lOmM Tris-HCL, pH 8.5 
preheated to 50°C, quantified spectrophotometrically (OD260) 
5 using a Beckman DU 650 spectrophotometer and stored at -70 ^C. 

Purification of Tn5062 DNA 

PQM5062 DNA was isolated using the Wizard SV minipreps (Promega) 
10 according to the manufacturer's instructions after growth in L 
broth containing ampicillin (50/ig/ml) and apramycin (100//g/ml) . 
Tn5062 was liberated from the plasmid by digestion with Pvull 
and electrophoresis on a 1% agarose gel made with TAE buffer. 
Following electrophoresis the 3442bp Tn5062 band was excised 
15 from the gel using a scalpel and purified using the QIAEX II gel 
extraction system (Qiagen) according to manufacturer's 
instructions. Following purification, Tn5062 was further 
purified using the QIAquick PGR purification kit (Qiagen) and 
ethanol precipitated before being resuspended in 10 (il of 
20 sterile distilled water and quantified by comparison with known 
standards after agarose gel electrophoresis. Finally DNA was 
stored at -70°C. 

Transposition Reaction 

25 

Transposition of Tn5062 into selected cosmids (Table 2) from the 
S .coelicolor cosmid library (Redenbach, at al., 1996) was 
carried out by preparing the following reaction mix, in the 
listed order, in an Eppendorf tube according to the 
30 manufacturer's instructions (Epicentre) and incubated for 2 
hours at 37°C: 

EZ::Tn lOx Reaction Buffer 1/xl 

S .coelicolor A3 (2) cosmid DNA 2.9/xl (200ng or 7.6x10'^ 

35 /xMoles) 
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Tn5062 DNA 



1.2/xl {17.5ng or 7.6x10 
piMoles) 



Sterile distilled water 



3.9/il 
1/il 



EZ::TN Transposase 



5 



After completion, the reaction was stopped by adding 1/xl of 
EZ::TN stop solution and incubated at 70°C for 10 minutes. 1/xl 
of the transposition reaction was then added to 40/xl of 
electrocompetent E.coli JM109 cells (prepared according to 

10 manufacturer's instructions) and electroporated using a Biorad 
Gene Pulser according to the manufacturer's instructions. 
Following electroporation cells were plated on L agar 
supplemented with ampicillin (50/xg/ml) , kanamycin (25/ig/ml) and 
apramycin (lOO/xg/ml) to select for colonies containing 

15 transposed cosmids. 

Isolation of transposed cosmid DNA 

96 ampicillin, kanamycin and apramycin resistant colonies were 
20 picked and inoculated to a 96 square well growth block (ABgene) , 
each well containing 1ml of L broth containing ampicillin 
{50/zg/ml) , kanamycin (25/ig/ml) and apramycin (lOO/xg/ml) . The 
block was then incubated overnight at 37°C, 225 rpm. The next 
day 1.3/xl of each of the 96 cultures was the transferred to a 
25 second growth block, each well containing 1.3ml of 2xYT 

supplemented with apramycin (100/xg/ml) and incubated for 18 
hours at 37^C, 225 rpm. 330/xl of 60% (w/v) glycerol was then 
added to each of the 96 wells from the first growth block, mixed 
and stored at -70 °C. Cosmid DNA was isolated from the cultures 
30 in the second growth block using the Wizard SV 96 kit from 
Promega and stored at -70®C. 

Identification of transposition target sites by sequencing 



35 



Transposed cosmid DNA (1/xl) was first electrophoresed on a 

0.7 (w/v) agarose gel to check DNA quality. For sequencing 10 /xl 
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from each of the 96 samples was transferred to a 96 well PGR 
plate (ABgene) and heated to 86 tor 5 minutes in a MJ Research 
PTC-200 DNA engine. To each sample was then added 2 fil of 
transposon specific sequencing primer EZRl (10pmol//xl) (Tablel, 
5 Fig. 1) and 8/xl of CEQ DTCS quick start sequencing kit (Beckman- 
Coulter) . The sequencing reactions were then carried out in a MJ 
Research PTC-200 DNA engine by heating to 96 °C (20 seconds) , 
55^C (20 seconds) and 60°C (4 minutes) for 50 cycles. The 
samples were then analysed on a CEQ 200XL sequencer (Beckman- 

10 Coulter) using the long fast read program according to the 
manufacturer's conditions. Following sequencing the 
transposition target site was determined by comparison of each 
of the 96 sequences with the S.coelicolor A3 {2) genome sequence 
at http : //www. Sanger . ac .uk/Projects/ S coelicolor/ (Bentley et 

15 al. 2002) , Identified insertions in cosmid SC7C7 are shown in 
Table 3 . 

Transfer of insertion to S. coelicolor A3 (2) 

20 Replacement of a wild type gene with the cosmid-borne transposed 
copy was carried out by conjugation from E.coli according to 
Kieser et al . (2000). E.coli ET12567 (pUZ8002) was grown in the 
presence of kanamycin (25/xg/ml) and chloramphenicol (25 /xg/ml) 
and chemically competent cells prepared according to Sambrook et 

25 al. (1989). Selected transposed cosmids were then transformed 
into these cells (Sambrook et al., 1989) and grown on L agar 
supplemented with ampicillin (50/xg/ml) , kanamycin (25/xg/ml) and 
apramycin (lOO/xg/ml) to select for colonies containing 
transposed cosmids. The next day a single transformant was 

30 inoculated into 10 ml of L broth containing apramycin 

(lOO/ig/ml) , kanamycin (25/xg/ml) and chloramphenicol (25 /xg/ml) 
and grown overnight at 37°C, 250 rpm. The next day 0.4ml of the 
overnight culture was added to 39.6 ml of L broth supplemented 
with apramycin (100/xg/ml) , kanamycin (25//g/ml) and 

35 chloramphenicol (25 /xg/ml) , grown at 37 °C to an optical density 
of ODsoo 0.4-0.6. At this point, cells were harvested by 
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centrifugation and washed twice in 40 ml of L broth, before 
being resuspended in 4 ml of L broth. Meanwhile, approximately 
1x10^ of S.coelicolor M145 spores were added to 500/zl of 2xYT, 
heat shocked (50^C, 10 minutes) and allowed to cool. To this 
5 500/xl of pregerminated spores was added 500/il of the E.coli 

cells containing the transposed cosmid, after mixing, the cells 
and spores were centrifuged, most of the supernatant fraction 
removed and the pellet resuspended in the residual liquid. This 
was then plated on SFM agar, supplemented with lOmM MgCl2 and 

10 incubated at 30°C for 16 hours. The next day the plate was 

overlayed with 1 ml of sterile distilled water supplemented with 
Img of apramycin and 0.5mg nalidixic acid and incubated at 30 
for a further 3-4 days. After this time, individual 
transconjugants were picked off and patched onto SFM agar 

15 supplemented with nalidixic acid (25/xg/ml) and apramycin (100 
/xg/ml) , similarly colonies were also patched onto SFM agar 
supplemented with nalidixic acid (25/ig/ml) and kanamycin (25 
//g/ml) . Those colonies that had undergone a gene replacement and 
replaced the wild type gene with the cosmid borne copy 

20 containing the insertion sequence were identified on the basis 
of apramycin resistance and kanamycin sensitivity. 

Detection of eGFP expression in S.coelicolor A3 (2) 

25 Sterile coverslips were inserted into SFM agar at a 45° angle 
and 10 111 of S .coelicolor A3 (2) spores were inoculated in the 
acute angle between coverslip and agar surface. After incubation 
for 1-7 days, coverslips were removed and washed twice by brief 
immersion in methanol. After drying, the coverslips were mounted 

30 on slides and examined microscopically using a Nikon Eclipse 
E600 fluorescence microscope. eGFP expression was observed by 
illumination with ultra violet light and fluorescence visualised 
with a FITC filter set. 

35 Although the foregoing invention has been described in some 
detail by way of illustration and example for purposes of 
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clarity and understanding, it will be readily apparent to those 
of ordinary skill in the art in light of the teachings of this 
invention that certain changes and modifications may be made 
thereto without departing from the spirit or scope of the 
appended claims. 

For further information on techniques and materials, the 
addressee is referred also to general reference texts, such as 
Sambrook et al (1989), Kieser et al. (2000), Ausubel et al . 
(1989) , and any later editions thereof (such as Sambrook and 
Russell (2001) , as well as to the product literature of 
Epicentre and other suppliers of commercially available 
transposition systems . 

Each publication and earlier application referred to herein is 
hereby incorporated by reference in its entirety and for all 
purposes . 

Table 1 Oligonucleotides 



Oligo 


Sequence 


VCl 


GATCTGAATTCGGATCCTAATTAATTAATCTAGAAAGX3AGGTGATCA 


VC2 


TATGATCACCTCCTTTCTAGATTAATTAATTAGGATCCGAATTCA 


VC3 


TATGGACGGAGCTCGGCCGCTTAAGGTACCGAATTCC 


VC4 


TCGAGGAATTCGGTACCTTAAGCGGCCGAGCTCCGTCCA 


EZRl 


ATGCGCTCCATCAAGAAGAG 
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Table 2 Plasmids and cosmlds 



Plasmid 


Source 


pET26B+ 


Novagen 


pALTERl 


Promega 


pEGFP-Nl 


Clontech 


pHP45Qaac 


Blondelet-Rouault et al. 
(1997) 


pMOD<MCS > 


Epicentre 


PIJ8660 


Sun et al. (1999) 


pVClOl 


This work 


pVC102 


This work 


pVC107 


This work 


PQM501 


This work 


PQM504 


This work 


PQM5052 


This work 


PQM5062 


This work 


PUZ8002 


Kieser et al. (2000) 


SC1A6 


Redenbach et al. (1996) 


SC3A3 


Redenbach et al. (1996) 


SC6A9 


Redenbach et al. (1996) 


SC7B7 


Redenbach et al. (1996) 


SC7C7 


Redenbach et al. (1996) 


SCE59 


Redenbach et al. (1996) 


SCF91 


Redenbach et al. (1996) 


SCH63 


Redenbach et al. (1996) 


SCH69 


Redenbach et al. (1996) 


SCI7 


Redenbach et al. (1996) 


SC4G10 


Redenbach et al . (1996) 


SC4B10 


Redenbach et al. (1996) 


SCH66 


Redenbach et al. (1996) 


SC2E9 


Redenbach et al . (1996) 


SC9B5 


Redenbach et al. (1996) 


SCI51 


Redenbach et al. (1996) 


2SCI34 


Redenbach et al. (1996) 


2SCG38 


Redenbach et al. (1996) 


sees 8 


Redenbach et al . (1996) 


SCC77 


Redenbach et al. (1996) 


SC9E12 


Redenbach et al . (1996) 


SCF4 3 


Redenbach et al . (1996) 


SC5C11 


Redenbach et al . (1996) 


2SCK8 


Redenbach et al. (1996) 


SCH44 


Redenbach et al. (1996) 


SC10F4 


Redenbach et al. (1996) 


SCD66 


Redenbach et al. (1996) 


SCE20 


Redenbach et al. (1996) 


SCD16A 


Redenbach et al. (1996) 


SCH22A 


Redenbach et al. (1996) 


SCF55 


Redenbach et al. (1996) 


SC3C3 


Redenbach et al . (1996) 


SC2A11 


Redenbach et al, (19 96) 
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Table 3 Examples of transcriptional TnS062 insertions in cosmid SC7C7 



Cosmid 


Trans- 
posant 


Insertion 

Position 

(Uenome) 


- 

Insertion 

Position 

(v^osmia) 


Inserted ORF 


Insertion 
Strand 

(+/-) 


Potential 
eOFP 

Transcnption 

(y/n/?) 


Target Site 










lijoorKiNA ozoyyyz...oz /iMy + rmb lob 




y 


CCCTl Cj 1 CjCj 






OZ / 1 jH 1 


t>40 


iJDoorKJMA ozoyyyz...oz / 1 J ly + rmb loo 


1 

-r 


y 


GuTGAATAC 




A 1 n 


DZ / 144o 




ojoorKiNA ozoyyyz...oz/i3iy + rmii loo 




n 


ACCTTCGAC 




OU/ 




1 Jo/ 


1 -7 r /TO -.TJXT A /COOIOAA /COT/I Q 1 Q _l_ HT 0*^0 

1 J joo rKJN a OZ / i oUU-.oz /4y 1 y -r rmb zjb 




n 


LG 1 A 1 ACGG 






A071QCQ 


zzy4 


ijjoo rKJNA OZ / I0UU...0Z /4y ly + rmbzib 


1 


y 


U 1 OCG 1 AAT 




IJ 1 n 




zJjO 


1 J^oo rKJNA OZ /I0UU...0Z /4yiy + rmb 2 Jo 


1 


y 


CGCCGAAGT 




r^i 1 


dZ i52Ay 


zDj4 


1 "2 C/CQ vOXT A /CT710AA /00/lfl10_L . ■ .iT? O 

1 jdoo rKJNA OZ /loUU...Dz/4y ly + rmb z3o 


1 

■r 


y 


GGGTAAGTC 








ZJOO 


1 -jCiTO _¥>xT A /OTIOAA /CT7ylOlQ_i_ ■ .nr? O^P 

1 jooo rKNA OZ/ 10UU...0Z /4yiy + rmb z Jo 




n 


GC I GTGGGC 


/ 


Jtiuy 


Dz /jjzy 


z0j4 


1 JjOo rKJNA OZ / I0UU...0Z /4y ly T- rmbzJo 


+ 


y 


/^T"/^ A A A A 

Cj 1 GAAAG A 1 






OZ /3 / /O 


3Uol 


tTC/CQ-DXTA /COTIQAA /COO/IQIQ-L ■ nr? QIC 

1 Jjoo rKJNA oz/ioUU...Dz/4yiy + rmb zJo 




n 


I GGCTGGTG 




CIO 

biz 


0Z/JOZ3 


J Izo 


1 Jjoo rKNA OZ/ loUU..,oz74yiy + rmb z Jo 




n 


GGGTTACGG 






DZ/44JD 


J /ol 


1 Jjoo rKJNA OZ / ioUU...oz/4yiy + rmb z Jo 




n 


GC 1111 ATG 


SC7C7 


EOl 


6274937 


4242 






? 


CAGTGGACG 


SC7C7 


A04 


6274964 


4269 






? 


GGGTTGTTC 


SC7C7 


E04 


6275179 


4484 






? 


TTCCGTCAC 


SC7C7 


EOS 


6275630 


4935 


13571 SC05746, SC7C7.01 
6275223...6275900 + hypothetical protein 
SC7C7,01 




n 


ACTGCTGAT 


SC7C7 


E02 


6275925 


5230 




+ 


9 


GGCTTGTTC 


SC7C7 


Dll 


6277043 


6348 


13573 SC05747, SC7C7.02c 
6276106...6278856 - putative regulatory 
protein 




y 


GGCCCGACC 


SC7C7 


COS 


6277305 


6610 


13573 SC05747, SC7C7.02c 
6276106...6278856 - putative regulatory 
protein 




n 


GGTCGGGAC 


SC7C7 


£08 


6277864 


7169 


13573 SC05747, SC7C7.02c 
6276106.,.6278856 - putative regulatory 
protein 




n 


CCGATGAAC 


SC7C7 


A05 


6277949 


7254 


13573 SC05747, SC7C7.02c 
6276106...6278856 - putative regulatory 

protein 




y 


GTGCTGCAG 


SC7C7 


F03 


6277949 


7254 


13573 SC05747, SC7C7.02c 
6276106...6278856 - putative regulatory 
protein 




y 


GTGCTGCAG 


SC7C7 


G06 


6278026 


7331 


13573 SC05747, SC7C7.02c 
6276106...6278856 - putative regulatory 

protein 




n 


GCGTAGACC 


SC7C7 


All 


6278408 


7713 


13573 SC05747, SC7C7.02c 
6276106...6278856 - putative regulatory 
protein 




y 


GCCGACCGC 


SC7C7 


H03 


6278566 


7871 


13573 SC05747, SC7C7.02c 
6276106...6278856 - putative regulatory 
protein 




n 


GCGTGGACC 


SC7C7 


D06 


6278734 


8039 


13573 SC05747, SC7C7.02c 
6276106.. .6278856 - putative regulatory 
protein 




y 


GTTCTGTGA 


SC7C7 


DIO 


6279071 


8376 






7 

• 


GATGAAGGT 


SC7C7 


B04 


6281301 


10606 


13576 SC05748, SC7C7.03 
6279265. ..6284754 + putative sensory 
histidine kinase 




n 


GCCACACAC 


SC7C7 


C04 


6281495 


10800 


13576 SC05748, SC7C7.03 
6279265. ..6284754 + putative sensory 
histidine kinase 




y 


GGTCACGCG 


SC7C7 


F02 


6281668 


10973 


13576 SC05748, SC7C7.03 




n 


CCCTTGGCG 
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Costnid 


Trans- 
posant 


Insertion 

Position 

(Genome) 


Insertion 

Position 

(Cosmid) 


Inserted ORF 


Insertion 
Strand 

(+/-) 


Potential 
eOFP 

Transcnption 

(y/n/?) 


Target Site 










6279265. ..6284754 + putative sensory 
histidine kinase 








SC7C7 




6281898 


11203 


13576 SC05748, SC7C7.03 
6279265. ..6284754 + putative sensory 
histidine kinase 


+ 


y 


GTCCAGGTG 


SC7C7 


A09 


6282182 


11487 


13576 SC05748, SC7C7.03 
6279265. ..6284754 + putative sensory 
histidine kinase 




n 


CACCTGACC 


SC7C7 


G02 


6282297 


11602 


13576 SC05748, SC7C7.03 
6279265.. .6284754 + putative sensory 
histidine kinase 




y 


pACCAGCTC 


SC7C7 


A07 


6283214 


12519 


13576 SC05748. SC7C7.03 
6279265,. . 6284754 + putative sensory 

histidine kinase 


+ 


y 


CCAGTCGTC 


SC7C7 


HU 


6283268 


12573 


13576 SC05748, SC7C7.03 
6279265.. .6284754 + putative sensory 
histidine kinase 




n 


GTTCTGCTG 


SC7C7 


B06 


6283391 


12696 


13576 SC05748, SC7C7,03 
6279265. ..6284754 + putative sensory 
histidine kinase 




n 


GCTGCGTAC 


sC7C7 




H'^O AAAC\ 




13576 SC05748, SC7C7.03 
6279265... 6284754 + putative sensory 
nistiuine Kinase 


1 

■r 


y 




SC7C7 


D07 


6284914 


14219 




4- 


9 

• 


ACGTACGGG 


SC7C7 


H09 


6286360 


15665 


13583 SCO5750, SC7C7.05 
6286097...6288886 + ftsK homolog 


+ 


Y 


GTCTTCCGC 


SC7C7 


B03 


6286832 


16137 


13583 SCO5750, SC7C7.05 
6286097...6288886 + ftsK homolog 


■1- 


y 


CGGCCACCC 


SC7C7 


F05 


6287186 


16491 


13583 SCO5750, SC7C7.05 
6286097...6288886 + ftsK homolog 




y 


TCGCCGACC 


SC7C7 


D03 


6287254 


16559 


13583 SCO5750, SC7C7.05 
6286097...6288886 + ftsK homolog 




n 


GCACCGGCG 


SC7C7 


AOS 


6288131 


17436 


13583 SCO5750, SC7C7.05 
6286097...6288886 + ftsK homolog 




y 


ACTTCAACC 


SC7C7 


Fll 


6288234 


17539 


13583 SCO5750, SC7C7.05 
6286097...6288886 + ftsK homolog 




n 


GGCCAGCTC 


SC7C7 


F09 


6288477 


17782 


13583 SCO5750, SC7C7.05 
o25o0y7...o2ooooo + ttsK nomolog 


1 

T 


y 


CI 1 1 


SC7C7 


B02 


6289171 


18476 






• 


CGCTCGAAA 


SC7C7 


DOS 


62o9754 


19059 


13587 SCO5751,SC7C7.06 

6289 1 90.. .6290053 + putative membrane 

« 

protein 




n 


OLrC 1 1 OOvjO 


SC7C7 


H04 


6290058 


19363 




+ 


? 


GCGGGGACC 


SC7C7 


C03 


6290188 


19493 


13588 SC05752, SC7C7.07 

6290145. ,.6291626 + conserved hypothetical 

protein SC7C7.07 




n 


GGCGCAGCC 


SC7C7 


F07 


6291027 


20332 


13588 SC05752, SC1C1,Q1 

6290145. ..6291626 + conserved hypothetical 

protein SC7C7.07 




y 


ACTTCGACC 


SC7C7 


Ell 


6291102 


20407 


13588 SC05752, ^OnClSSl 

6290 145... 629 1626 + conserved hypothetical 

protein ^ClCl.Ql 




y 


TCCTGGAGC 


SC7C7 


E03 


6291533 


20838 


13588 SC05752, SClCl.Ol 

6290 145.,. 629 1626 + conserved hypothetical 

protein SC7C7.07 


- 


n 


CCATACGAC 


SC7C7 


BOl 


6292335 


21640 


13591 bC(J5753, pgsA o291d23... 6292414 + 

phosphatidylglycerophosphate synthase 




n 


GTCCAGGCC 
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Cosmid 


Trans- 
posant 


Insertion 
rosition 

^^vjcnome ) 


Insertion 
rosition 

^^x^osuiia 1 


Inserted ORF 


Insertion 
Strand 

(+/-) 


Potential 
eGFP 

Transcription 

(y/n//) 


Target Site 


SC7C7 


Bll 


6293516 


22821 






7 


GTTTTCGCA 


SC7C7 


A06 


6293979 


23284 


13598 SC05756, SC7C7.11 

o2yjo51... 5/94121 + nypotnetical protein 

SC7C7.11 




n 


GTCCACGAC 


SC7C7 


ElO 


6294345 


23650 


13601 SC05757, SC7C7.12 

0294 i2is... 0294394 + hypothetical protein 

SC7C7.12 




n 


GCGCAGGGC 


SC7C7 


C07 


6294633 


23938 


13602 SC05758, SC7C7.13 

6294450.., 0294824 + putative transcriptional 

regulator 


+ 


y 


TGGTCAAGG 


SC7C7 


D09 


6295168 




13604 SC05759, SC7C7.14 

o294o21.. .0295306 + hypothetical protein 

SC7C7.14 




n 


CGGTGAGCG 


SC7C7 


C09 


6295384 


24689 


13oUD c>CU576U, SC/C7.15C 
6295344...6296174 - DNA glycosylase 




n 


GTGCGGGCC 


SC7C7 


C08 


6295844 


25149 


I36U5 bCU5760, SC7C7.15C 
6295344...6296174 - DNA glycosylase 


+ 


n 


GTCCAGCAC 


SC7C7 


805 


6295934 


25239 


13605 SCO5760, SC7C7.15c 
6295344...6296174 - DNA glycosylase 




y 


GCGCTGGAG 










13605 bCU5760, bC7C7.15c 

DZ"j J*f*T...Dzyo 1 /H- - i-/iN/\ giycosyiasc 




n 






H06 


6296183 


25488 






? 


CCCTTGAGT 


SCICI 


G04 


6296468 


25773 


13606 SC05761, SC7C7.16c 

6290193.,. 630 1265 - putative A i r-aependent 
DNA helicase 




n 


GGAGCCCGC 


SC7C7 


BOS 


6296627 


25932 


13606 SC05761, SC7C7.16c 

6296 193... 630 1265 - putative ATr-dependent 

DNA helicase 




y 


GTACGACAC 


SC7C7 


A03 


6297137 


26442 


13606 SC05761, SC7C7,16c 

6296 193. ..6301265 - putative Air-dependent 

DNA helicase 




y 


CGAGGAGAG 


SC7C7 


C02 


6298130 


27435 


13606 SC05761, SC7C7.16c 

6296 193.. .630 1265 - putative Air-dependent 

DNA helicase 




n 


CGTGAAGGG 










13606 SC05761,SC7C7.16c 

6296 193.. .630 1 265 - putative A 1 F-dependent 

jjiNA neiicase 




n 




SC7C7 


FOl 


6301309 


30614 




- 


7 

• 


GCCACGCCC 


SC7C7 


G09 


6302647 


417 


13616 SCO5763,SC4H8.02 

Z'l A'^O 1 A AO AOA _I_ M..^<^4-I«>_ ■ - .. 1 

63023 19.. .6303089 + putative membrane 
protein SC4H8.02 




n 


CGGGAGGGC 


SC7C7 


B07 


6303689 


1459 


13619 SC05765, SC4H8.04c 

6303404. .,6304429 - hypothetical protein 

SC4H8.04C 




n 


AGCACGGCG 


SC7C7 


GOl 


6304465 


2235 


13621 SC05766, SC4H8.05 

6304454... 6304648 + hypothetical protein 

SC4H8.05 




n 


CCGTCAACC 


SC7C7 


G12 


6304465 


2235 


13621 SC05766, SC4H8.05 
ojU4HjH...oJU4o*to ^ nypoineucai proiein 
SC4H8.05 




n 


CCGTCAACC 
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